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Abstract. In this paper we study the accuracy of pho- 
tometric redshifts computed through a standard SED fit- 
ting procedure, where SEDs are obtained from broad-band 
photometry. We present our public code hyperz, which is 
presently available on the web. We introduce the method 
and we discuss the expected influence of the different 
observational conditions and theoretical assumptions. In 
particular, the set of templates used in the minimization 
procedure (age, metallicity, reddening, absorption in the 
Lyman forest, ...) is studied in detail, through both real 
and simulated data. The expected accuracy of photomet- 
ric redshifts, as well as the fraction of catastrophic identifi- 
cations and wrong detections, is given as a function of the 
redshift range, the set of filters considered, and the pho- 
tometric accuracy. Special attention is paid to the results 
expected from real data. 
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1. Introduction 

The estimate of redshifts through photometry is one of the 
most promising techniques in deep universe studies, and 
certainly a key point to optimize field surveys with large- 
field detectors. It is in fact an old idea of Baum (1962), 
who originally applied it to the measure of redshifts for 
elliptical galaxies in distant clusters. It was later used by 
several authors in the eighties (Couch et al. 1983, Koo 
1985) on relatively low-redshift samples, observed in the ~ 
4000 to 8000 A domain. Later in the nineties, the interest 
for this technique has increased with the development of 
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large field and deep field surveys, in particular the Hubble 
Deep Field North and South (HDF-N and HDF-S). 

Basically two different photometric redshift techniques 
can be found in the literature: the so-called empirical 
training set method, and the fitting of the observed Spec- 
tral Energy Distributions (hereafter SED) by synthetic 
or empirical template spectra. The first approach, pro- 
posed originally by Connolly et al. (1995, 1997), derives 
an empirical relation between magnitudes and redshifts 
using a subsample of objects with measured spectroscopic 
redshifts, i.e. the training set. A slightly modified version 
of this method was used by Wang et al. (1998) to de- 
rive redshifts in the HDF-N by means of a linear function 
of colours. This method produces small dispersions, even 
when the number of filters available is small, and it has 
the advantage that it does not make any assumption con- 
cerning the galaxy spectra or evolution, thus bypassing 
the problem of our poor knowledge of high redshift spec- 
tra. However, this approach is not flexible: when different 
filter sets are considered, the empirical relation between 
magnitudes and redshifts must be recomputed for each 
survey on a suitable spectroscopic subsample. Moreover, 
the training set is constituted by the brightest objects, 
for which it is possible to measure the redshift. Thus, this 
kind of procedure could in principle introduce some bias 
when computing the redshifts for the faintest sources, be- 
cause there is no guarantee that we are dealing with the 
same type of objects from the spcctrophotometrical point 
of view. Also, the redshift range between 1.4 and 2.2 had 
been hardly reached by spectroscopy up to now, because 
of the lack of strong spectral features accessible to optical 
spectrographs. Thus, no reliable empirical relation can be 
found in this interval. 

The SED fitting procedure, described in detail in the 
following section, bases its efficiency on the fit of the over- 
all shape of spectra and on the detection of strong spectral 
properties. The observed photometric SEDs are compared 
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to those obtained from a set of reference spectra, using 
the same photometric system. The photometric redshift 
of a given object corresponds to the best fit of its photo- 
metric SED by the set of template spectra. This method 
is used mainly for applications on the HDF, using either 
observed or synthetic SEDs (e.g. Mobasher et al. 1996, 
Lanzetta et al. 1996, Gwyn & Hartwick 1996, Sawicki et 
al. 1997, Giallongo et al. 1998, Fernandez-Soto et al. 1999, 
Arnouts et al. 1999, Furusawa et al. 2000). A crucial test 
in all cases is the comparison between the photometric and 
the spectroscopic redshifts obtained on a restricted sub- 
sample of relatively bright objects. A combination of this 
method with the Bayesian marginalization introducing a 
prior probability was proposed by Bemtez (2000). 

The aim of this paper is to explain in a straightforward 
way the expected performances and limitations of photo- 
metric redshifts computed from broad-band photometry. 
This study has been conducted with our public code called 
hyper -z, which adopts a standard SED fitting method, but 
most results should be completely general in this kind of 
calculations. This program was originally developed by 
Miralles (1998) (see also Pello et al. 1999), and the present 
version of the code hyperz is available on the web at the 
following address: 

http : //webast . ast . obs-mip . f r/hyperz . 

The plan of the paper is the following. In Section 2 
we present the method used by hyperz and the involved 
set of parameters. The accuracy of the redshift determina- 
tions and the expected percentage of catastrophic identifi- 
cations, as a function of the filter set and the photometric 
errors, are studied through simulations in Section 3. The 
influence of the different parameters on the accuracy of 
photometric redshifts is investigated in Section 4, using 
both simulations and spectroscopic data from the HDF. 
Section 5 is devoted to the analysis on the expected ac- 
curacy and possible systcmatics when exploring real data, 
coming from deep photometric surveys. A general discus- 
sion is given in Section 6 and conclusions are listed in Sec- 
tion 7. 

2. The method 

Photometric redshifts (hereafter z p hot) are based on the 
detection of strong spectral features, such as the 4000 A 
break, Balmcr break, Lyman decrement or strong emission 
lines. In general, broad-band filters will allow to detect 
only "breaks" , and they are not sensitive to the presence 
of emission lines, except when their contribution to the 
total flux in a given filter is higher or of the same order 
of photometric errors, as it happens in the case of AGNs 
(Hatziminaoglou et al. 2000). 

The method used in this paper to compute photomet- 
ric redshifts is a SED fitting through a standard x 2 mini- 
mization procedure, computed with our code hyperz. The 
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Table 1. Characteristics of filters used in the simulations: the 
effective wavelength A e ff and the surface of the normalized re- 
sponse function. 



observed SED of a given galaxy is compared to a set of 
template spectra: 
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where F b s ,i, -Ftemp,i and Oi are the observed and template 
fluxes and their uncertainty in filter i, respectively, and b 
is a normalization constant. 

The new Bruzual & Chariot evolutionary code (GIS- 
SEL98, Bruzual & Chariot 1993) has been used to build 
8 different synthetic star-formation histories, roughly 
matching the observed properties of local field galaxies 
from E to Im type: a delta burst, a constant star-forming 
system, and six /x-models (exponentially decaying SFR) 
with characteristic time-decays chosen to match the se- 
quence of colours from E-S0 to Sd. We use the Initial Mass 
Function (IMF) by Miller & Scab (1979), but this choice 
has a negligible impact on the final results, as discussed 
in Section 4.6. The upper mass limit for star formation 
is 125 Mq. The basic database includes only solar metal- 
licity SEDs, but other possibilities are discussed in Sec- 
tion 4. The library also includes a set of empirical SEDs 
compiled by Coleman, Wu and Weedman (1980) (here- 
after CWW) to represent the local population of galaxies. 
CWW spectra were extended to wavelengths A < 1400 A 
and A > 10000 A using the equivalent GISSEL spectra. 
The synthetic database derived from Bruzual & Chariot 
includes 408 spectra (51 different ages for the stellar popu- 
lation and 8 star- formation regimes). In most applications, 
there is no sensible gain when the number of /i-models is 
reduced to only 3, thus including only 255 spectra. 

Throughout this paper we use the same set of broad- 
band filters, with characteristics presented in Table 1. 
These filters cover all the wavelength domain under study, 
without major overlap or gap. We also include the HDF 
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filters used in Sections 4 and 5 (from Biretta et al. 1996). 
The hyperz filter library is an enlarged version of the orig- 
inal Bruzual & Chariot one, and presently includes 163 fil- 
ters and detector responses. All magnitudes given in this 
paper refer to the Vega system. 

Hyperz has been optimized to gain in efficiency when 
computing z p hot on large catalogues. The input data for 
a given catalogue are magnitudes and photometric errors. 
To compute a reliable estimate of z p hot: the colours and 
the corresponding photometric errors must be obtained 
with particular care, including uncertainties due to zero- 
points, intrinsic accuracy, etc. Magnitudes are obtained 
within the same aperture in all filters, after correction for 
seeing differences between images. For a given catalogue, 
the relevant parameters introduced in the z p hot calculation 
are: 

— The set of template spectra. This point includes the 
SFR type, the possible link between the age and the 
metallicity of the stellar population, and the choice of 
an IMF. It is discussed in Section 4. 

— The reddening law is usually taken from Calzetti et 
al. (2000), but 4 other laws are also included in the 
code. This is discussed in Section 4.4. The input value 
is Ay, corresponding to a dust-screen model, with 
F (X) = Fi(A)10-°- 4AA , where F Q and F, are the ob- 
served and the intrinsic fluxes, respectively. The ex- 
tinction at a wavelength A is related to the colour 
excess Eb-v and to the reddening curve fc(A) by 
A x = k(X)E B -v = k{X)A v /R, with R = 3.1 except 
for the Small Magellanic Cloud (R = 2.72) and the 
Calzetti's law (R = 4.05). The normal setting for Ay 
ranges between and 1.5 magnitudes. The mean galac- 
tic extinction correction towards a given line of sight 
can be introduced in terms of Eb-v, and it is applied 
to the whole catalogue. 

— Flux decrements in the Lyman forest are computed 
according to Giallongo & Cristiani (1990) and Madau 
(1995), both of them giving similar results. 

— The limiting magnitude in each filter, and the rule to 
be applied in the case of non detection. The rule is set 
for each filter independently, and there are 4 different 
possibilities: 0) the filter is not taken into account in 
the computation; 1) the flux in this filter is set to with 
an error bar corresponding to the flux deduced from 
the limiting magnitude; 2) the flux in this filter is set to 
1/2 of the limiting flux, according to the limiting mag- 
nitude, and the associated 1 sigma error is ±1/2 times 
this value; 3) the flux and the 1 sigma error in this filter 
are computed from the limiting magnitude and from 
the error associated to the limiting magnitude (both 
fixed). Case 1 is the usual setting when one is dealing 
with a relatively deep survey in the considered filter, 
whereas case applies to "out-of-field" objects. Case 
2 and 3 are well suited for relatively shallow surveys. 
The idea of "shallow" and "deep" in this context refers 



to relative values of the limiting magnitudes associated 
to the different filters in the photometric catalogue. 
— The cosmological parameters Ho, fio and Ct\, which 
are only related here to the maximum age allowed 
to the stellar population at a given redshift. The age 
checking is an option. 

Due to the degeneracy in the parameter space defined 
by the SFR type, age, metallicity and reddening, the z p hot 
computation for a given object is equivalent to finding the 
most likely solution for the redshift across this parameter 
space, regardless to details on the best-fit SED (see Fig- 
ure 1). Both the z p hot and the SED are obtained through 
hyperz, together with the best fit parameters {Ay, spec- 
tral type, metallicity and age). Because of the degeneracy 
between these parameters, the relevant information shall 
be the redshift and the rough SED type, in the sense that 
a given object has a "blue" or "red" continuum at a given 
z, but no reliable information can be obtained about the 
other parameters from broad-band photometry alone. 




redshift 



Fig. 1. Artist view of the SED fitting procedure to compute 
Zphot- The figure presents a likelihood map for a representa- 
tive object at z ~ 4. The shaded area encloses the highest 
confidence level region according to the x 2 associated proba- 
bility. Each point on the redshift-age map corresponds to the 
best fit of the SED obtained across the parameter space. The 
degeneracy in the parameter space is shown in this example. 



3. Filters and photometric accuracy 

In this section we study through simulations the quality 
of the z p hot as a function of the filter set, the photometric 
accuracy and the redshift, i.e. the robustness of the red- 
shift determination and the expected percentage of catas- 
trophic identifications and spurious detections. The aim 
of this exercise is to study the systematic effects produced 




Fig. 2. Comparison between z mo dei and Zphot for simulated catalogues with Am = 0.1 and filters sets BVRI, UBVRI, UBVRIJ, 
UBVRIK, BVRIJK, UBVRIJHK. Dotted lines correspond to A z = 0.2, dashed lines to A z = 0.5 and thin solid lines to A z = 1. 
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filters 



BVRI 



UBVRI 



Am 

0.05 
0.10 
0.20 
0.30 
0.05 
0.10 
0.20 
0.30 



0.18 
0.20 
0.29 
0.25 
0.07 
0.09 
0.20 
0.28 



0.0 - 0.4 

0.00 
-0.05 
-0.19 
-0.34 
-0.03 
-0.03 
-0.11 
-0.20 



0.4 - 1.0 

Oz (A;) 

-0.13 
-0.21 
-0.24 
-0.24 
-0.07 
-0.11 
-0.19 
-0.18 



0.22 
0.30 
0.37 
0.38 
0.17 
0.21 
0.29 
0.31 



1.0 - 2.0 

Oz (Az) 

0.10 
0.08 
0.18 
0.25 
0.12 
0.17 
0.17 
0.16 



0.32 
0.38 
0.41 
0.42 
0.26 
0.35 
0.42 
0.49 



2.0 - 3.0 

Oz (A z ) 

0.07 
0.10 
0.01 
-0.21 
0.04 
0.11 
0.12 
0.11 



0.30 
0.45 
0.54 
0.53 
0.21 
0.33 
0.41 
0.45 



3.0 - 5.0 

Oz (Az) 

0.08 
0.10 
0.14 
0.14 
0.05 
0.08 
0.09 
0.11 



0.23 
0.25 
0.34 
0.35 
0.17 
0.23 
0.27 
0.29 



5.0 - 7.0 

Oz (Az) 

0.05 
0.07 
0.07 
0.09 
0.06 
0.04 
0.06 
0.07 



0.16 
0.19 
0.23 
0.28 
0.18 
0.19 
0.23 
0.27 



UBVRI Z 



0.05 
0.10 
0.20 
0.30 



0.04 
0.07 
0.17 
0.21 



-0.01 
-0.02 
-0.08 
-0.12 



0.11 
0.16 
0.22 
0.27 



-0.05 
-0.08 
-0.11 
-0.15 



0.25 
0.28 
0.41 
0.44 



0.11 
0.11 
0.11 
0.12 



0.13 
0.23 
0.34 
0.40 



0.04 
0.07 
0.16 
0.18 



0.17 
0.22 
0.28 
0.32 



0.06 
0.09 
0.12 
0.15 



0.09 
0.14 
0.19 
0.24 



0.01 
0.02 
0.05 
0.08 



UBVRIJ 



0.05 
0.10 
0.20 
0.30 



0.04 
0.08 
0.17 
0.22 



-0.01 
-0.01 
-0.09 
-0.13 



0.07 
0.11 
0.19 
0.26 



-0.02 
-0.06 
-0.08 
-0.15 



0.11 
0.20 
0.30 
0.35 



-0.01 
-0.01 
0.00 
0.01 



0.12 
0.14 
0.23 
0.30 



0.07 
0.10 
0.14 
0.17 



0.12 
0.17 
0.26 
0.30 



0.04 
0.07 
0.13 
0.17 



0.11 
0.15 
0.18 
0.22 



0.02 
0.03 
0.03 
0.06 



UBVRI K 



0.05 
0.10 
0.20 
0.30 



0.04 
0.08 
0.17 
0.22 



-0.01 
-0.02 
-0.064 
-0.14 



0.07 
0.10 
0.16 
0.25 



-0.02 
-0.05 
-0.08 
-0.14 



0.11 
0.22 
0.31 
0.36 



0.00 
0.01 
0.01 
-0.01 



0.21 
0.27 
0.32 
0.35 



0.07 
0.12 
0.18 
0.20 



0.13 
0.18 
0.22 
0.26 



0.04 
0.08 
0.12 
0.14 



0.15 
0.15 
0.21 
0.23 



0.05 
0.06 
0.08 
0.09 



BVRIJK 



0.05 
0.10 
0.20 
0.30 



0.06 
0.12 
0.19 
0.25 



-0.01 
-0.03 
-0.06 
-0.14 



0.06 
0.11 
0.23 
0.26 



-0.03 
-0.05 
-0.07 
-0.14 



0.16 
0.24 
0.33 
0.40 



0.00 
0.00 
-0.04 
-0.04 



0.16 
0.19 
0.27 
0.32 



0.01 
0.02 
0.06 
0.09 



0.13 
0.18 
0.23 
0.27 



0.03 
0.07 
0.11 
0.16 



0.09 
0.12 
0.16 
0.20 



0.03 
0.04 
0.07 
0.09 



UBVRI J K 



0.05 
0.10 
0.20 
0.30 



0.04 
0.09 
0.18 
0.23 



-0.01 
-0.01 
-0.05 
-0.09 



0.05 
0.09 
0.18 
0.24 



-0.02 
-0.03 
-0.06 
-0.10 



0.13 
0.21 
0.32 
0.33 



0.02 
0.03 
0.03 
0.05 



0.06 
0.11 
0.20 
0.28 



0.00 
0.02 
0.05 
0.08 



0.12 
0.15 
0.18 
0.21 



0.03 
0.05 
0.09 
0.11 



0.09 
0.12 
0.15 
0.19 



0.02 
0.03 
0.05 
0.07 



UBVRI JHK 



0.05 
0.10 
0.20 
0.30 



0.03 
0.10 
0.18 
0.25 



0.00 
-0.02 
-0.06 
-0.12 



0.05 
0.10 
0.17 
0.25 



-0.01 
-0.02 
-0.07 
-0.11 



0.10 
0.20 
0.28 
0.33 



0.00 
-0.01 
-0.02 
-0.04 



0.06 
0.13 
0.19 
0.27 



0.00 
0.03 
0.06 
0.09 



0.09 
0.13 
0.19 
0.23 



0.01 
0.04 
0.09 
0.11 



0.09 
0.11 
0.16 
0.21 



0.02 
0.03 
0.06 
0.09 



Table 2. Summary of results obtained on simulated catalogues with a homogeneous redshift distribution as a function of the 
redshift bin, filters set and photometric errors Am. See the text for a complete description. 



by the sampling of the SED and the associated noise com- 
ing from photometry. Catastrophic identifications (1%) are 
those with |A Z | = |z mo dd — z p hot| > 1, and such objects 
are thus lost from their original redshift bin. The accuracy 
of -Zphot in a given redshift bin is defined by the mean dif- 
ference (A z ) = A z /iV of the sample with respect to the 
model redshift, excluding catastrophic identifications, and 
the standard deviation a z = \/^(A z — (A Z )) 2 /(N — 1). 
Spurious identifications (g%) correspond to objects which 
are incorrectly assigned to a given z p hot interval, and thus 
susceptible to contaminate the statistics within this z p hot 
interval; in this case |A Z | > 3 x a z . 

Some of these quantities, in particular 1% and g%, de- 
pend on assumptions about redshift number counts and 
photometric depth. For this reason we compute them only 
for a set of simulations with a more realistic modeling for 
galaxy counts, according to a Pure Luminosity Evolution 



(PLE) scenario. We discuss the results as a function of the 
photometric parameters in Section 5. 

Simulated catalogues of 1000 objects were produced, 
with a homogeneous redshift distribution, in order to com- 
pute the above mentioned parameters as a function of the 
filter set and photometric accuracy. In all cases, the types 
and ages assigned to the different galaxies in a redshift bin 
are randomly chosen from the 8 GISSEL98 template fam- 
ilies mentioned above, with solar metallicity. Photometric 
errors in these homogeneous catalogues are introduced as 
a noise following a Gaussian distribution of fixed la in 
magnitudes for each band (0.05 to 0.3 magnitudes, i.e. ~ 
5 to 30 % photometric accuracy), and they are uncorre- 
cted for different filters. For each filter set we study the 
quality of z p hot as a function of the photometric accuracy, 
In this particular case, photometric errors do not scale 
with magnitudes. A realistic error distribution is used in 
Section 5. The value of the visual extinction Ay ranges be- 
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tween and 1. For each simulated galaxy, hyperz computes 
a Zphot value, as well as the z p hot error bars corresponding 
to P = 68, 90, 99% confidence levels, computed by means 
of the Ax 2 increment for a single parameter (Avni 1976). 
The redshift step used to search solutions between z = 
and z = 7 is Az = 0.05, with an internal accuracy which is 
10 times better. The choice of the primary z-step between 
0.1 and 0.05 does not affect significantly the results. 

Figure 2 shows the behaviour of the different sets of 
simulated samples when the z p hot is compared to the true 
Zmodei- The results of these simulations are summarized in 
Table 2. Without near-IR photometry, the errors on indi- 
vidual galaxies become huge at 1.2 < z < 2.2 as expected 
due to the lack of strong spectral features in the visible 
band. In particular, in this redshift range the 4000 A break 
goes out of the I band and the Lyman break does not 
yet affect the photometry in the filter U. This problem 
is solved when near-IR is included. In fact, J, H, and K 
filters allow to bracket the 4000 A break. Also, the lack of 
U band photometry introduces an enhanced uncertainty 
in the z < 0.4 domain (mainly because of the contribution 
at 0.2), because at z < 0.2 none of the other filters is 
able to detect a strong break. 

All these results are almost independent on the type 
of galaxy, provided that the evolving population of stars 
is older than a few <~ 10 7 years typically. This point is 
discussed in details in next section. 

The dispersion in z p hot is strongly sensitive to the 
photometric uncertainties. There is no significant gain 
for Am < 0.05 magnitudes (about 5% accuracy). This 
value roughly corresponds to the typical photometric un- 
certainties in deep photometric surveys, when all the er- 
ror sources are included. The dispersion and the num- 
ber of multiple solutions with similar weight rapidly in- 
crease up to Am ~ 0.3 magnitudes. Including near-IR 
JHK photometry strongly reduces the error bars within 
the 1.2 < z < 2.2 range, without significantly improving 
the uncertainties in z p hot outside this interval. If the filter 
Z is considered in addition to the five optical filters, the 
resulting dispersion at low redshift become smaller up to 
Zmodd — 1.5, but the degeneracy at z mo dci = 1.5-3 still 
remains, even if less dramatic. 

In Figure 3 we illustrate the probability functions for 
two simulated galaxies at low and high redshift: the solu- 
tion becomes better constrained around the model value 
and the degeneracy between high and low redshift solu- 
tions disappears with increasing photometric accuracy and 
when the wavelength range extends up to the near infrared 
region. 

The typical dispersion in z p hot obtained here is simi- 
lar to the values found in the literature, even when the 
techniques used are appreciably different (Brunner et al. 
1997, Connolly et al 1997, ...). In most published studies 
it is extremely difficult to compare the accuracy of z p hot 
as a function of photometric errors. 



These results are useful to understand the general 
trends expected from a given configuration of filters and 
photometric accuracy. Nevertheless, z p hot techniques are 
often applied to statistical studies, which require more "re- 
alistic" simulations in order to define the right observa- 
tional strategy for the photometric survey. Then, a realis- 
tic redshift distribution is needed. For most applications, a 
PLE model is enough to determine the main trends. Also, 
photometric uncertainties have to be scaled with magni- 
tude, to reproduce the behaviour of real catalogues. These 
points are discussed in Section 5. 

4. Influence of the different parameters on z p hot 
accuracy 

4-1. Templates and Lyman forest blanketing 

We have studied the influence of the set of templates used 
on the final results through a comparison between the hy- 
perz z p hot determinations and real spectroscopic data on 
HDF. All the other parameters are fixed in this case, and 
the only difference is the set of templates used to com- 
pute Zphot- Table 3 summarizes these results. A similar 
blind test was recently performed by Hogg et al. (1998) 
on a sample of HDF-N galaxies at z < 1.4, using different 
procedures and, in particular, different sets of templates. 

We have computed photometric redshifts for the 
sample of 108 galaxies on the HDF-N with observed 
^spec (Cohen et al. 1996; Cowie 1997; Zepf et al. 1996; 
Steidel et al. 1996; Lowcnthal et al. 1997) consid- 
ered by Fernandez-Soto et al. (1999) plus 4 galaxies 
from HDF-S (Glazebrook et al. 2000, in preparation). 
Among these, 83 galaxies are at z < 1.5 and 29 at 
2 < z < 6. Photometry was obtained from the Stony 
Brook's group (Fernandez-Soto et al. 1999, SUNY web 
pages http://www.ess.sunysb.edu/astro/hdfs) using 
the package SExtractor (Bertin & Arnouts 1996) to detect 
sources, and consists in 7 filters for the HDF-N (F300W, 
F450W, F606W, F814W plus near infrared photometry 
in JHK filters obtained by Dickinson et al. 1998 at the 
KPNO IRIM camera) and 12 for the HDF-S (F300W, 
F450W, F606W, F814W, plus an additional shallow opti- 
cal catalogue UBVRI from NTT SUSI2, and near infrared 
JHK data obtained with NTT SOFI). Here we consider 
results obtained using the 7 filters for the HDF-N galaxies 
and all the 12 available filters for the four objects of the 
HDF-S subsample. Calculations on the HDF-S using 7 fil- 
ters do not affect significantly the individual photometric 
redshift and the overall statistic. 

To calculate magnitudes from the available measured 
fluxes in the catalogues, we considered as non-detection 
criterion a signal-to- noise ratio S/N < 1. In this case we 
assigned a magnitude = 99 and we used the information 
about the limiting magnitude in the involved filter. 

Three different sets of templates are considered in this 
section: the basic 5 GISSEL98 models with solar metallic- 
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Fig. 3. Examples of the evolution of the probability distributions associated to X 2 ( z ) as a function of the filter set and 
photometric errors, for two simulated objects. Left: z mo dei = 0.282. Right: z mo de\ = 2.396. Dotted lines refer to Am = 0.3, 
dashed lines: Am = 0.2, solid lines: Am = 0.1. The vertical line marks the true z m odci value. 



ity mentioned above (1 delta burst, 3 /^-decaying, 1 con- 
stant star-formation system), the CWW set of empirical 
SEDs, and the CWW set extended with a SED of a very 
blue galaxy taken from GISSEL library (Miller & Scalo 
IMF, constant SFR, age = 0.1 Gyr). Adding new very blue 
spectra to the third set does not change perceptibly the re- 
sults. As for the simulated catalogues, we search solutions 
in the redshift interval z = - 7 with a step Az = 0.05. 
In all crude limit in absolute magnitude has been 

imposed to compute z p hot, with Mb G [—28,-9]. More- 
over, we checked the age of the template to be consistent 
with the age of the universe at the considered redshift, de- 
pending on the cosmological model. Here we use Qq — 1, 
£1a = and Hq — 50kms _1 Mpc -1 . The reddening is as- 
sumed to range from Ay — to 1.2, following the Calzetti 
et al. (2000) law. 

The comparison between z spcc and z p hot for the 112 
galaxies of the sample is shown in Figure 4, for the three 
sets of templates hereafter referenced as (a), (b) and (c), 
respectively. Each of them produces a fairly good agree- 
ment with the measured spectroscopic redshifts, but no- 
ticeable differences appear when considering the values of 
the dispersion, computed as 



c G \ y lj — 1 |^phot,2 ^spec.il^ 1 

z = TTJz) = V n^i — TT^) ' 

in the two redshift domains [z < 1.5 and 2 < z < 6): 

— for z spcc < 1.5 ((z) ~ 0.65) we found: (a) S z — 
0.09; (b) 0.21; (c) 0.17. If we exclude objects with 
I Zphot — z spec \ > 0.5, the value of S z reduces to 0.06 
in case (a) (81 objects), and to 0.08 for (b) and (c) (81 
and 82 objects respectively). In case (a), these rejected 
objects correspond to the two galaxies with uncertain 



spectroscopic redshifts (see Arnouts et al. 1999). Thus, 
using GISSEL models in this redshift domain produces 
more accurate results than CWW templates alone. 
— in the high redshift domain, the dispersion in all the 
considered cases is 5 Z = 0.26 ((z) ~ 3.06). If we remove 
catastrophic identifications (2 objects), characterized 
by l^phot — z spcc | > 1, then 5 Z = 0.10 in case (a), 0.08 in 
(b) and 0.07 in (c). In this case, the CWW set produces 
slightly better results than GISSEL, probably due to a 
Lyman blanketing effect. This point is discussed below. 

In general, the reasons of failures can be ascribed to 
many effects, such as a wrong photometry (systematic er- 
rors when measuring magnitudes or underestimated pho- 
tometric errors) leading to a highly unlikely fit, or a prob- 
ability function with significant secondary peaks, because 
of degeneracy among the fit parameters, or a relatively 
"flat" probability function due to a lack of sufficient pho- 
tometric information. The last explanation applies partic- 
ularly to the object at z spec = 5.64, which is detected only 
in filter F814W and which is at the limit of detection in 
F450W, with S/N ~ 1.5. However, if we use all the avail- 
able photometry, disregarding the S/N criterion, we ob- 
tain Zphot = 5.13. The object at z S pcc = 2.931 is placed at 
low redshift by other groups (Fernandez-Soto et al. 1999, 
Arnouts et al. 1999). Nevertheless a secondary peak, with 
a very small x 2 probability, is found at z = 2.90. 

We can remark that at high redshift the cases (b) 
and (c) are better centred around the spectroscopic value. 
However, their \ 2 values are higher than in case (a). The 
reason suspected for that is the one-to-one relation in- 
troduced here between the Lyman-forest absorption and 
the redshift. We investigate this problem by assigning dif- 
ferent values to the Lyman-forest decrement, multiplying 
the values of the mean line blanketing (Da) and (Db) 
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Fig. 4. Comparison between photometric and spectroscopic redshifts for the HDF spectroscopic sample. Error bars in z p hot 
correspond to 3<r. Different sets of template spectra are used in the various panels: (a) the basic 5 G1SSEL98 models with solar 
metallicity; (b) the CWW set of empirical SEDs; (c)the CWW set extended with a SED of a very blue galaxy. The lower panels 
(d), (e) and (f) present the comparison between template sets of different metallicities. See text for more details. 



provided by Madau (1995) by a factor 0.5 and 1.5, then 
increasing or decreasing the absorption (Furusawa et al. 
2000). We found a better fit to the HDF data when the Ly- 
man forest along the line of sight produces a smaller flux 
decrement with respect to the mean value. In this case we 
obtain 5 Z = 0.09 for the GISSEL case (a), a value which 
is similar to the value of CWW SEDs. An overestimate 
of absorption due to neutral hydrogen induces a subse- 
quent and systematic underestimate of redshifts, because 
the same attenuation of the flux could be reproduced with 
a solution at lower redshift. Hence a careful knowledge of 
the UV region of SEDs is essential to accurately assess 
Zphot! furthermore, the Lyman forest represents the most 
important signature of spectra in the high redshift regime. 
Thus it is important to allow the blanketing in the Lyman 
forest to span a sufficiently wide range of values in order to 
prevent systematic effects at high-z, which could depend 
on the line of sight. 



It is worth to notice that, even if all the template 
SEDs reproduce the spectroscopic redshifts on the HDF 
with sufficient accuracy, the redshift distributions of galax- 
ies could change significantly when we are dealing with 
objects fainter than the spectroscopic limits, for which 
no training set is available. When the redshift distribu- 
tion obtained on the HDF with CWW templates is com- 
pared with the equivalent one computed with GISSEL 
templates, there are no strong differences in the overall 
distribution. Nevertheless, this result could not apply to 
all cases. A straightforward example is the case of a deep 
photometric survey using visible filters only, without near- 
IR photometry, and designed to probe the low surface- 
brightness regime. It is easily shown that, in this case, 
a degenerate solution could exist for the faintest "blue" 
sources, for which it is impossible to decide between a 
low-z solution (low surface-brightness object with a very 
young stellar population, as presented in next subsection) 
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Table 3. Summary of dispersions in the HDF z p hot measurements for the different sets of templates, where 8 Z — a/ [1 + (z)], us- 
ing the Calzetti's law with Av ranging from to 1.2 magnitudes. The total number of objects considered in each non-catastrophic 
sample is given in brackets. See more details in text. 



and a relatively bright 1 < z < 2.5 galaxy, with ongo- 
ing star-formation (no strong signatures on a continuum 
increasing bluewards). In that case, using the CWW tem- 
plates alone will tend to select the later solution systemat- 
ically, whereas including templates spanning a wide range 
of ages for the stellar population (such as GISSEL) could 
select the former solution, thus leading to a completely 
different redshift distribution. We prefer to adopt a rela- 
tively large number of GISSEL's templates, to supply a 
wide baseline for modeling the age effects, rather than to 
assume the evolution reproduced by the transformation in 
a different local spectral type. 

4-2. Age of the stellar population 

Photometric redshifts are efficient when a spectral feature 
is detected through the filters with an important strength 
as compared to photometric uncertainties. When we are 
dealing with the stellar continuum of a young stellar pop- 
ulation, the 4000 A break becomes visible at ~ 10 7 years 
(see Bruzual & Chariot 1993). In most cases, this lack of 
strong features could not be compensated by the presence 
of strong emission lines, simply because such lines have 
a negligible effect on the integrated energy when using 
broad-band filters (see Section 4.7). 

In order to study the effects of age on z p hot estimates 
as a function of redshift, we have produced different sets of 
catalogues corresponding to different ages, all of them with 
a uniform distribution in z for the delta burst SED (single 
stellar population model). Figure 6 displays the general 
trends of z p hot versus z mo dci for representative ages and 
the UBVRIJHK set of filters. In this case the set of tem- 
plates used is the basic GISSEL one with solar metallicity. 
At z mc ,dei ^ 3, the redshift determination is accurate for 
any age because of the presence of Lyman break in the 
filter U. At smaller redshifts, z p hot is based on the 4000 A 
break as the strongest spectral signature, and it is visible 
only in systems which are a few ~ 10 7 years old. 
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Fig. 5. The dispersion cr z /(l + (z)) as a function of the mean 
redshift of the considered range. Ages are 10 6 ,10 7 ,10 8 ,10 9 and 
10 10 yr from top to bottom. 

The results obtained applying hyperz to these cata- 
logues are summarized in Figure 5, where we show the 
effect described above by means of the dispersion in four 
redshift bins: the value of a z / ( 1 + (z) ) decreases increasing 
the redshift and the age of the stellar population. 

4-3. Cosmology 

The effects of cosmological parameters (Hq, Slo and ^a) 
are only related to the age allowed to the stellar popu- 
lation at a given redshift. When using hyperz, the age 
of the stellar population can be optionally limited to the 
age range permitted by the cosmological parameters. In 
order to quantify such effect on Zphot, if any we have 
compared the results previously obtained on the HDF 
(with the crude age limitation given above) with those 
obtained without age constraints, and also with a differ- 





Fig. 6. Comparison between z mo dei and z p hot for simulated catalogues of single burst galaxies with Am — 0.1, filters set 
UBVRIJHK and ages of galaxies 10 6 , 10 7 , 10 8 and 10 9 yr. 



ent set of cosmological parameters (f2rj = 0.3, J7a = 0.7 
and Hq = SOkms^ 1 Mpc" 1 ). These results show that the 
effect of the cosmological parameters on the z p hot estimate 
is negligible, because they affect S z by less than 1%. 

4-4- Reddening 

The five reddening laws presently implemented in hyperz 
are: 

1. Allen (1976) for the Milky Way (MW); 

2. Seaton (1979) fit by Fitzpatrick (1986) for the MW; 

3. Fitzpatrick (1986) for Large Magellanic Cloud (LMC); 

4. Prevot et al. (1984) and Bouchet et al. (1985) for Small 
Magellanic Cloud (SMC); 

5. Calzetti et al. (2000) for starburst galaxies. 



The different laws are presented in Figure 7. 

Recent studies on high redshift galaxies and star for- 
mation obscured by dust have shown the importance of 
reddening in the high-z universe. In order to probe this is- 
sue on Zphot computations, we have compared the results 
previously obtained on the HDF to those obtained assum- 
ing no reddening, all the other parameters being fixed. We 
found S z = 0.13(0.07 without catastrophic objects) for the 
\ow-z bin and S z = 0.50(0.13) for the high-z one, but with 
a much higher percentage of catastrophic identifications: 
10 objects at z spcc ~ 3 are erroneously identified as low 
redshift galaxies. 

Therefore, keeping a wide range of reddening values 
seems to be essential to reproduce the SEDs of high red- 
shift galaxies. According to Steidel et al. (1999), the typi- 



M. Bolzonella et al.: Photometric z based on standard SED fitting procedures 



11 



5 



1 

i \ 
"\ \ 
■\ \ 
■ \ \ 

:\\\ 


i 

Allen (MW) ' 

Seaton (MW) 

_ Fitzpatrick (LMC) 
Prevot (SMC) 

_ Calzetti (SB) '. 


■ > \\ \ 
i 





MA] 

Fig. 7. Extinction curves fe(A) for the different reddening laws 
implemented in hyper z. 



cal Eb-v f°r galaxies to z ~ 4 is 0.15 mags, thus Ay — 0.6 
mags when using a Calzetti's law. The maximum Ay al- 
lowed in our calculations is about 2 times this value. 

Moreover, we conducted a test to study the influence 
of the different reddening laws, using all the implemented 
possibilities. We found that the laws reproducing the ex- 
tinction of the Milky Way and the Large Magellanic Cloud 
are not appropriate to fit the SEDs of high redshift galax- 
ies (z sp ec > 2), whereas they leave the low redshift region 
unaffected. Instead, the fourth law, corresponding to the 
Small Magellanic Cloud, produces results similar to those 
obtained with the curve provided by Calzetti et al. (2000). 
It correctly assigns the z p hot to the high redshift objects, 
but it places a couple of low z spcc objects at higher z p hot- 
The last effect is probably due to the higher and steeper 
fc(A) at short wavelength as compared to Calzetti's, which 
mimics the additional effect of the UV attenuation induced 
by the Lyman forest. At high redshift, the most important 
wavelength region is the UV, between 1000 A and 3000 A, 
where the considered laws give quite different trends, thus 
modifying in a different way the magnitudes and produc- 
ing different values of % 2 . In fact, most of the fits to the 
HDF sample using reddening laws from 1 to 4 produce 
worse x 2 values than the Calzetti's law, in particular for 
those objects requiring Ay > 0.6. These galaxies cannot 
be reproduced by the MW and LMC laws, even when the 
limit of Ay is increased up to Ay = 2. 

Thus, the slope of the selected reddening law at short 
wavelengths must be defined carefully; the extrapolation 
used here to extend the laws 1 to 4 towards wavelengths 
not covered by data is rather poor. These considerations 
get stronger evidence that the modeling of the UV region 
of SEDs is essential to recover correctly the high z galaxies. 
The re-emission of energy coming from dust heated by 



massive star formation does not affect the present results, 
because we concentrate on the UV to near-IR bands. 

4-5. Metallicity 

We have also checked the influence of the metallicity on 
the Zphot estimates using the same HDF training sample. 
The same computations have been done using different 
and extreme assumptions for the metallicity of the stel- 
lar population, with values ranging from 0.005Z Q to 5Z & 
(as allowed by Bruzual & Chariot's models). We have also 
developed a self-consistent set of templates, where the evo- 
lution in metallicity of the stellar population is explicitly 
taken into account (cf. Mobasher & Mazzei 1999). In other 
words, there is a natural link between the age of the stel- 
lar population and its mean metallicity. For all metallic- 
ity cases, we have built up the same closed-box systems 
presented before: a constant star-forming galaxy and six 
/z-models. 

Three sets of templates were considered: the 3 different 
metallicities together (solar and the 2 extreme values) , the 
two extreme values alone, and the self-consistent model. 
A comparison among all these cases is given in Figure 4 
(d,e,f). The dispersions at low redshift without failed ob- 
jects are S z = 0.05,0.06,0.05 respectively, for the 3 differ- 
ent sets. At high redshift we found S z — 0.11,0.10,0.10, 
under the same assumptions. A slight improvement on the 
accuracy of z p hot at z ^ 1.5 is observed when several differ- 
ent metallicities are used together, and the self-consistent 
model (f) produces the best fit in this redshift range. On 
the other side, including different metallicities does not 
affect the high redshift determinations. 

4-6. Initial Mass Function 

The influence of the IMF has also been tested on the HDF 
spectroscopic sample. We have used the self-consistent 
modeling, which takes into account the evolution in metal- 
licity of the stellar population and produces the best fit to 
the HDF data when using the Miller & Scalo IMF (1979). 
We have built up the same closed-box models for 2 ad- 
ditional IMFs, Salpeter (1955) and Scalo (1986), keeping 
the same upper mass limit for star formation. When ap- 
plying these new templates to the HDF sample, we find 
exactly the same results in terms of z p hot accuracy. Look- 
ing more carefully to the results obtained for individual 
objects, we find that the z p hot estimates are approxima- 
tively the same, whatever the IMF used. This result is easy 
to understand because the changes induced on the stellar 
continuum by the different IMF slopes are compensated in 
most cases by the other parameters (reddening, age, ...), 
thus giving the same z p hot result but a different solution 
in the parameter space. 

When we compute z p hot on simulated data, the z p hot 
accuracy is the same when we use a unique IMF in model 
galaxies and templates and when we use a different IMF 
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in both settings. In addition, we have checked on possi- 
ble systematic changes on the spectral types derived by 
hyperz in the later case, with negative results. In particu- 
lar, a model catalogue built with Miller & Scalo IMF was 
analysed with Salpeter and Scalo IMFs, and the results 
were the same as in the Section 4.8 below. This strength- 
ens the idea of the IMF being a secondary parameter in 
•Zphot estimates. 



4-7. Emission lines 

As long as we are dealing here with broad-band photom- 
etry, the presence of emission lines on the spectra has a 
relatively small effect on the integrated fluxes, and thus 
a small influence on the z p hot results. This can be easily 
quantified when we consider the sample of blue compact 
galaxies at z < 1.4 studied by Guzman et al. (1997), and 
the samples of star-forming galaxies described by Cowie 
et al. (1995), Glazebrook et al. (1995) and Tcrlcvich et al. 
(1991). At relatively low redshift, the main emission lines 
to consider are [On]A3727, Ha, H/3 and [Om]AA4959, 5007, 
[Oil] and Ha being the most important contributions to 
the integrated fluxes. According to Guzman et al. (1997), 
the [On]A3727 luminosity of star- forming galaxies can be 
approximated by L([0n]) ~ 10 29 T / FjQ TT jis, where W^Oii] 
is the equivalent width and Lb is the blue luminosity in 
solar units. 

For our purposes, an emission line can be overlooked 
when /(e-line)// A < I - 10-°- 4Am , where /(e-line) and f\ 
are, respectively, the integrated fluxes within the emission 
line and the stellar continuum through the filter, and Am 
is the photometric uncertainty in magnitudes. A realistic 
value of Am ~ 0.05 to 0.1 mags (~ 5 to 10% uncertainty) 
imposes /(e-linc)//A < 0.05 to 0.1. The limit in equivalent 
width for galaxies in the Guzman et al. sample is a few 
times 100 A, thus most compact star-forming galaxies ful- 
fill this condition. Even when we consider the typical lumi- 
nosities of vigorous star-forming sources (L([On]) ~ 10 42 
erg/s, Cowie et al. 95, Glazebrook et al. 1995), emission 
lines arc found to be negligible in most of them. Also the 
large majority of Hn galaxies in the Terlevich et al. (1991) 
local sample fulfill the condition. 

Thus, emission lines do not seem to influence signifi- 
cantly the Zphot results on star-forming galaxies. On the 
contrary, this is not the general case when we are dealing 
with AGNs, or when the photometry is obtained through 
narrow-band filters. We have not considered here neither 
the contribution of AGN to the simulated samples, nor the 
influence of such templates on the final accuracy when we 
are dealing with real data. AGN SEDs could be easily 
introduced in our present scheme, and this particular ap- 
plication is presently under development (Hatziminaoglou 
et al. 2000). 



4-8. Recovering the main SED parameters through hyperz 

As mentioned before, hyperz allows to obtain the z p hot 
and the best fit parameters across the whole space. The 
fitting procedure does not favour any parameter in partic- 
ular. The homogeneous simulations presented in Section 3 
could be used to briefly discuss on the efficiency to recover 
the most relevant input parameters: the spectral types, the 
age of the stellar population and Ay. Because of the de- 
generacy between these parameters, and the lack of spec- 
tral resolution, we only expect a rough spectral type to be 
retrieved from broad-band photometry. We have consid- 
ered the 8 spectral types presented in Section 2 to illus- 
trate the case. A general trend appears when comparing 
the model and retrieved spectral types, whatever the red- 
shift, filter combination and photometric accuracy, with 
single bursts and early types being more easily identified 
than late types at all redshifts. Figure 8 displays an exam- 
ple obtained with the UBVRIJK filter combination and 
10% photometric accuracy, excluding catastrophic iden- 
tifications (less than 1% in this case). The trend remains 
the same whatever the distribution in types, from these de- 
tailed 8 types to a rough Burst-E/S/Im distribution. Low- 
ering the S/N or the number of filters slightly increases the 
trend in terms of contrast between the early type and late 
type behaviour. Late type misidentifications are due to 
the degeneracy between age of the stellar population and 
spectral type, such galaxies being incorrectly assigned to 
younger and earlier types. In other words, there is often 
a burst-like template, of suitable age and length, which 
is able to fit the dominant stellar population of a galaxy 
observed through broad-band filters. The results are the 
same whatever the configuration in the parameter space, 
in particular, changing the order or the position of the dif- 
ferent templates in the space produces the same results. 
Degenerate solutions in the redshift dimension are system- 
atically displayed by hyperz, but this is only an option for 
the other dimensions of the parameter space. There is no 
systematic trend in the case of catastrophic identifications, 
but more than 90% of such objects in these simulations 
have misidentified spectral types as well. 

In the case of Ay, the procedure will choose the 
best and the lowest possible value. The results in this 
case are much better, whatever the S/N, provided that 
near IR filters are included. Using a grid of AAy = 
0.2 to explore the parameter space, the typical value 
of a AV = VXXAav - {Aav)) 2 / {N - 1) ranges between 
o~av = 0.15 and 0.3, for photometric accuracies between 
5 and 30%, for all the filter combinations including J, H 
or K (or a combination of them). In all the other cases, 
o~av = 0.3 to 0.45, for photometric accuracies between 
5 and 30%. These values are an average through all the 
spectral types and redshifts, excluding catastrophic iden- 
tifications. Similar estimates on catastrophic objects show 
an increase between +0.05 and +0.3 on cfav, depending 
on the filter set. 
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Fig. 8. Comparison between the model spectral types and the 
best-fit templates recovered by hyperz, for the simulations com- 
puted in Section 3. Spectral types, from 1 to 8, correspond to 
galaxies ranging from early (Bursts/E) to late (Im) types. Cir- 
cle sizes scale with the number of objects. Ideally only the 
diagonal region should have been populated. 



In summary, it is difficult to obtain detailed informa- 
tion on the spectral types from broad-band photometry 
alone, and this is probably the result of the poor spectral 
resolution. Near IR photometry allows to constraint the 
Ay value for all spectral types. Only early type galaxies 
could be reliably identified by this method. For later types, 
only a rough estimate of the SED type could be obtained, 
in terms of "blue" or "red" continuum. The classification 
in this case shall cither include the spectral type and the 
age of the stellar population, or be based on a simple set 
of templates such as CWW. 

5. Expected accuracy on real data 

In order to discuss on the expected accuracy and possi- 
ble systematic errors when exploring real data, we have 
performed a complete set of simulated catalogues, with 
a more realistic (non uniform) redshift distribution and 
S/N ratio along the SEDs. We have adopted the simple 
PLE model proposed by Pozzetti et al. (1996, 1998), with 
minimal changes, to derive the redshift distributions and 
to assign a magnitude to each object in the different fil- 
ters. Four galaxy types (exponentially decaying SFR with 
characteristic time r = 1 Gyr and lOGyr, constant SFR 
with evolution in time and at a fixed age of 0.1 Gyr) and 
their corresponding luminosity functions are used to re- 
produce the number of galaxies expected at a given red- 
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Table 4. Limiting magnitudes at S/N = 1: (d) deep pencil 
beam-like survey, (s) shallow ground-based survey. 



shift and absolute magnitude M^. Apparent magnitudes 
are computed from the evolved SEDs, with ages depend- 
ing on the redshift considered, the formation redshift, set 
to Zform = 7, and the cosmological parameters. Photo- 
metric errors are scaled to apparent magnitudes assum- 
ing the approximate relation Am ~ 2.51og[l + 1/(S/N)], 
where S/N is the signal to noise ratio, which is given as 
a function of the apparent magnitude through S/N = 
(S/N) Q l0~ OA ( m - mo \ (S/N) being the signal to noise ra- 
tio at a given reference magnitude mo- For simplicity, the 
photometric error is assigned to the apparent magnitude 
m according to a Gaussian distribution of Am fixed a. 
This relation is set to reproduce the rapid increase of 
uncertainties when approaching the limiting magnitudes. 
According to these equations, a value of S/N = 10, corre- 
sponding to Am ~ 0.1, is reached 2.5 magnitudes brighter 
than the magnitude corresponding to S/N — 1. An ob- 
ject with S/N < 1 is non-detected in the involved filter 
(m = 99). An object is included in the final catalogue if 
it is detected in the filter / (assuming that this is the se- 
lection filter), and in at least two other filters. The last 
requirement is needed to compute z p hot- 

The same filter combinations discussed in Section 3 
have been used to produce the new simulated catalogues. 
The simulations in Section 3 represent an ideal case, with 
an infinite depth and a fixed photometric error, disregard- 
ing the dependence on errors versus magnitudes. However, 
the relevant quantities a z , 1% and g% strongly depend on 
the number of objects in each redshift bin and then on the 
limiting magnitudes. 

To give a qualitative idea of the accuracy expected 
with different observational configurations, we consider 
two representative cases. 

5.1. Deep pencil beam surveys 

Firstly, we focus on simulations obtained in the case of a 
pencil beam-like survey, i.e. a very deep observation, cov- 
ering a small area. From the photometric point of view, 
the main improvement with respect to the uniform dis- 
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tributions presented above is that we can introduce, for 
each object, a realistic S/N in the different filters, with 
different values from filter to filter. We assume that the 
detection limit is reached (S/N = 1) at magnitudes sim- 
ilar to the limiting magnitudes of the HDF, as reported 
in the column my lm (d) of Table 4 and in the right part of 
the same table for the HDF-N filters. To obtain approxi- 
mately the same number of galaxies observed in the HDF, 
a field of 5 arcmin 2 has been simulated. The percentages 
of spectral types included in the simulated catalogue are 
- 11,58,28,3% for E, Sb, Im, and Im(age = 0.1 Gyr) 
respectively. In order to reproduce the observed number 
counts at faint magnitudes (Williams et al. 1996) we as- 
sume an open cosmological model, with f2o = 0.1 and 
£1a = 0. In this case, the peak of the redshift distribution 
is at z <; 1 and very few objects are seen at low-z, in par- 
ticular at redshifts between z = and z — 0.4. Moreover, 
the PLE model is known to overestimate the population 
of high redshift galaxies. 

In Table 5 we display the computed quantities o~ z , 1% 
and g% for the set of filters of the HDF-N, and for all the 
other deep survey combinations considered in Section 3 
(marked by (d) in the second column). The table contains 
the dispersion and the percentages of spurious and catas- 
trophic objects, computed from a set of 10 independent 
simulations for each configuration. The interpretation of 
data in Table 5 must take into account that the definition 
of g% depends on the dispersion a z computed using the 
correctly assigned objects, and this quantity is quite sen- 
sitive to the different filter sets and redshift bins. Never- 
theless, these simulations take properly into account the 
observed properties of galaxies in deep surveys, such as 
the presence of faint objects with huge photometric er- 
rors, and the lack of detection in some filters leading to 
an uncertain z p hot estimate (that is, increasing the prob- 
ability of misidentifications, enlarging the error bars and 
the dispersion around the true value). In particular, this 
effect is evident when looking at the trend of the 1% val- 
ues. At higher redshift we find an increasing number of 
faint objects that are non detected in some filters. This 
lead to an increase of 1%. Because of the depth of limit- 
ing magnitudes, we adopt the non detection law number 
1 for optical filters and 2 for the near infrared ones. In the 
highest redshift bins, the increase of the dispersion value 
tends to mitigate the effect of the z p hot deterioration in 
the value of g%. 

In the case of HDF-N filter set, we considered also two 
subcatalogues built with more restrictive selection crite- 
ria, requiring the detection both in 7gi4 and in at least two 
other filters to be S/N > 5 and S/N > 10. Statistics con- 
cerning these simulations are tabulated in Table 5. Obvi- 
ously, when considering objects with increasing S/N, the 
accuracy of z p hot estimate significantly improves. In Fig- 
ure 9 we show the results obtained on the comparison be- 
tween Zphot and model redshifts, and also on the iV(-z m odei) 




1 2 3 4 5 



model 



100 

N 

50 







HDF+JHK 


1 1 1 1 1 






r l 
















r~ Jin 








-J III 

, LJ 

1 — i 1 










i r 1 





















1 2 3 4 5 

z 

Fig. 9. Top: comparison between z mo dei and z p hot for realistic 
catalogue HDF-like. See the text for the considered limiting 
magnitudes. Small dots, crosses and circles correspond to ob- 
jects brighter than (S/N)n m — 1,5, 10 respectively, at least in 
the / filter and in two other filters. Bottom: redshift distribu- 
tions for the simulation on the top with (S/N)n m = 1. Solid 
line: N(z mode i). Dashed line: N(z phot ). 

versus N(z p hot) reconstruction. Most of the discrepancy is 
due to objects with S/N < 10. 

5.2. Shallow wide field surveys 

In the second case, the aim is to reproduce the observa- 
tional conditions reached when using 8 m telescopes and 
a wide field detector. In particular, we consider the case 
of a survey in a ~ 60 arcmin 2 field, observed with all the 
filter sets considered in Section 3. The adopted limiting 
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filters 




z = 0.0 - 0.4 


0.4 - 1.0 


1.0 - 2.0 


2.0 - 3.0 


> 3 


cr z /% 3% 


<J Z 1% g% 


a z 1% g% 


a z 1% g% 


ct z 1% g% 


S/N = 5 
S/N = 10 




06 0+0 3 + 7 
0.06 ±0 3±7 
0.06 ±0 3±7 


17 1+1 8 + 3 
0.09 + 8 + 2 
0.07 0±0 4±2 


99 1 + 1 19 + 9 
0.20 + 4 + 2 
0.15 + 2 + 1 


39 8 + 1 4+1 
0.30 1 + 1 0+1 
0.20 + + 


34 30 + 3 1 +1 
0.30 2 + 2 + 
0.19 + + 


BVRI 


(d) 


0.08 10 ±8 38 ±15 
017 30 + 3 10 + 9 


0.32 8 + 2 40 + 4 
36 1 1 + 9 7+1 


0.46 19 + 1 10 + 2 
45 1 8 + 1 6 + 1 


0.56 34 + 3 4+1 
70 58 + 7 39 + 6 


0.47 31 + 6 7+1 


UBVRI 
S/N — 1 
S/N = 5 
S/N = 10 


(d) 

(s) 
( s ) 


0.06 0±0 37+ 13 
15 17 + 1 95 + 3 
0.09 5 + 1 7 + 2 
0.07 1 + 1 2 + 2 


0.30 7 + 1 21 + 4 
34 7 + 1 4+1 

XJ.O^l 1 _l_ J- rt _L_ ± 

0.26 2 + 1 2 + 1 
0.18 1 + 1 1 + 1 


0.43 12 + 2 10 + 2 
49 1 9 + 9 9 + 1 
0.49 14 + 6 2 + 1 
43 4 + 7 4 + 5 


0.50 17 + 2 4+1 
58 41 + 1 1 4 + 7 


0.46 28 + 3 5 + 1 


UBVRIZ 


(d) 

( s ) 


0.06 + 20 + 10 
0.14 18 + 1 25 + 2 


0.23 6 + 2 28 + 3 
29 11 + 1 7+1 


0.45 12 + 2 2 + 1 
0.53 21 + 2 1 + 1 


0.43 12 + 2 11 ±1 
42 41 + 9 35 ± 8 


0.48 24 + 2 6 + 2 


UBVRIJ 


(d) 
f s ) 


0.05 + 13 + 9 
17 11 ± 1 19 + 2 

U» i- 1 J- J- — 1— -L i *J — 1 — — 


0.19 3 + 1 25 + 3 
25 5 ± 1 4+1 


0.37 5 + 1 6 + 1 
45 11 + 2 2 + 1 

V / . 1 ' ' 11 — 1 — — — — 1_ 1 


0.42 15 + 2 6 + 2 
50 35 ± 8 28 + 7 


0.43 28 + 3 2 + 2 


UBVRIK 


(d) 
(s) 


0.06 + 9 + 11 
0.17 14 + 1 14 + 2 


0.19 3 + 1 22 + 2 
0.25 7 + 1 4+1 


0.36 5 + 1 7+1 
0.43 11 + 2 2 + 1 


0.44 14+1 2 + 1 
0.54 34 + 6 25 + 7 


0.39 14 + 3 3 + 2 


BVRIJK 


(d) 


0.06 + 7 + 6 
019 17 + 9 6 + 9 


0.17 3 + 1 32 + 3 
94 7 + 1 9 + 1 


0.39 5 + 2 9 + 1 
39 8 + 1 4+1 


0.43 25 + 3 4 + 1 
n 07 qa + 6 58 + 8 


0.40 29 + 2 3 + 3 


UBVRIJK 
S/N = 1 
S/N = 5 
S/N = 10 


(d) 
( s ) 

(s) 
(s) 


0.06 + 5 + 5 
0.17 13 + 2 12 + 3 
0.09 2 + 1 6 + 2 
0.06 + 2 + 2 


0.17 3 + 1 23 + 4 
0.24 6 + 1 3+1 
0.15 1 + 1 3+1 
0.11 + 2 + 1 


0.35 3 + 1 8 + 1 
0.40 6 + 2 3 + 1 
0.40 8 + 2 2 + 1 
0.27 7+ 10 5 + 4 


0.41 13 + 3 4+1 
43 29 + 7 34 + 9 


0.38 26 ± 4 2 ± 2 


UBVRI 
JHK 


(d) 
(s) 


0.05 + 5 + 6 
0.17 13 + 2 9 + 2 


0.17 3 + 1 24 + 2 
0.24 5 + 1 3+1 


0.35 3 + 1 8 + 1 
0.38 5 + 1 4+1 


0.39 13 + 2 6 + 1 
0.36 23 + 4 41 + 8 


0.38 27 + 3 2 + 1 



Table 5. The dispersion a z and the percentage of catastrophic and spurious objects, 1% and g%, with errors, in five redshift 
bins, computed from 10 realizations of simulated catalogues with a redshift distribution derived from a PLE model, (s) and (d) 
refer to shallow and deep surveys respectively. The data are replaced by a dash when there are no enough data to compute the 
statistics. For the three examples in Figures 9, 10 and 11 we also present the quantities mentioned above as a function of the 
limiting signal to noise ratio considered for the detection. 



magnitudes are shallower and conservative with respect 
to the values in the previous simulations. They are shown 
in the column mii m (s) of Table 4. The percentages of the 
different spectral types for catalogues with these limiting 
magnitudes, using the same detection criteria, are similar 
to the previous ones, being ~ 12, 54, 26, 8% for E, Sb, Im, 
and Im(age = 0.1 Gyr) respectively. 

Results for cr z , 1% and g% are presented in Table 5 
and marked by a (s). For the filter sets UBVRI and UB- 
VRIJK we repeated the same procedure adopted for the 
HDF-N simulated catalogue, to build two subcatalogues 
with higher S/N thresholds. Figure 10 presents the re- 
sults obtained with the five optical bands only, whereas 
Figure 11 displays the equivalent results with the addi- 
tional photometry in two near infrared filters. Figures 10 
and 11 show the associated input and recovered redshift 
distributions. 

The peak of the redshift distribution in this case is at 
a lower redshift compared to the HDF simulation. Wide- 
field surveys allow to obtain a better sampling of the 
bright end of the luminosity function with respect to HDF- 
like surveys, the later being more suited to explore the 
faint luminosity regime. The value of g% and 1% change 



significantly when considering the same set of filters, but 
a different kind of survey. On the contrary, a z remains 
similar. 

An interesting feature is the opposite trend displayed 
by deep pencil-beam compared to shallow wide-field sur- 
veys with respect to the low and high redshift regimes for 
a given filter set. At low redshifts, the value of g% is larger 
for the deep pencil-beam survey (type (d) in the Table 5) 
than for the shallow (s) wide-field one. Conversely, the ac- 
curacy of deep surveys overcome that of the shallow ones 
at high redshifts. In this context, the separation between 
low and high redshift regimes is marked by the z = 1 - 2 
bin. 

This behaviour could be easily explained when we con- 
sider the different characteristics of the catalogues pro- 
duced in the two cases. The deep survey catalogue con- 
tains few low redshift galaxies, and most of them derive 
from the faint tail of the luminosity function. These faint 
galaxies are much more abundant than the bright ones, 
than they are present in the catalogue even though the 
volume covered at low redshift by this survey is small. The 
photometric errors for these intrinsically faint objects are 
rather large, thus causing a poor estimate of z p hot ■ On the 




Fig. 10. Top: comparison between z mo dei and z p hot for realistic 
catalogue for a shallow survey with 5 niters. The symbols are 
the same as in Figure 9. Bottom: redshift distributions for the 
simulation on the top with (S/N)n m = 1. Solid line: N(z mo dei)- 
Dashed line: N(z p i lot ). 



Fig. 11. Top: comparison between 2 mo dei and z p hot for realistic 
catalogue for a shallow survey with 7 filters. The symbols are 
the same as in Figure 9. Bottom: redshift distributions for the 
simulation on the top with (S/N)-^^ = 1. Solid line: -/V(z mo d e i)- 
Dashed line: N(z p hot)- 



contrary, the wide-field survey contains a large quantity 
of bright galaxies at low redshift, wich have sufficiently 
small photometric errors to obtain accurate £ p hotS. The 
faintest objects are lost in this case because of the shallow 
detection limits. The majority of galaxies in the shallow 
wide survey lies in the low redshift bins, around the peak 
of N(z). When we consider the population of galaxies be- 
yond the peak of the redshift distribution, the photomet- 
ric errors in the shallow survey become important and an 
increasing fraction of objects is non detected in various 
filters. These problems hamper a robust determination of 
-Zphot- On the contrary, the pencil beam survey take ad- 



vantage of its depth, allowing to compute Zphot at higher 
redshifts. 

On the basis of these results, we caution that the kind 
of analysis presented here is strongly advised when a pho- 
tometric survey is undertaken in view of computing z p hotS. 
In particular, the filter configuration and the photometric 
depth to reach in each filter have to be determined accu- 
rately in advance, in order to optimize the survey and to 
study the feasibility of the project. 
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6. Discussion 

Making use of the Bayesian technique, Bcmtez (2000) 
demonstrated that the dispersion of z p hot can be signif- 
icantly improved. Despite of this result, we decide not to 
introduce this possibility in our code, at least for general 
purposes. The reason for this is that we want to prevent 
spurious effects in particular studies. As an example, when 
the luminosity function is imposed, the study of the galaxy 
population is constrained and it becomes impossible to 
obtain independent information on the properties of ob- 
jects, thus limiting the possible applications. However, this 
method can be regarded with interest when the purpose is 
addressed to some specific application or when one is deal- 
ing with poor data, in such a way that the introduction of 
hints permits to obtain useful results. Alternatively, the 
photometric redshift estimate can be safely improved in- 
troducing the Bayesian inference when prior information 
is not related to photometric properties of sources. Exam- 
ples of such priors that could be combined with the z p hot 
technique are the morphology or the clues inferred from 
gravitational lensing modeling. 

One of the main issues for z p hot is the optimization 
of the visible versus near-IR bands for spectroscopic sur- 
veys. The aim is to produce a criterion based in z p hot to 
discriminate between objects showing strong spectral fea- 
tures in the optical and in the near-IR. To perform this 
test, both the redshift and the SED characteristics have to 
be estimated for each object. The z p hot and the SED are 
obtained by means of hyperz, together with the best fit 
parameters (Ay, spectral type, metallicity and age). The 
relevant information shall be the redshift and the rough 
SED type, i.e. "blue" or "red" continuum at the given z. 
We have shown that only limited information could be 
obtained on the parameter space from broad-band pho- 
tometry alone. This situation will change with the future 
cryogenic imaging spectrophotometers, as presented in a 
recent paper by Mazin & Brunner (2000), because such 
devices will be able to gain in spectral resolution while 
spanning a large wavelength domain. 

Another important issue for z p hot is the improvement 
on the cluster detection in wide-field photometric surveys. 
Including such a technique in an automated identification 
algorithm, whatever this algorithm is, allows to improve 
significantly the detection levels. The main idea is that 
the contrast between the cluster and the foreground and 
background population is the leading factor. When intro- 
ducing a simple detection scheme, similar to the one used 
by Cappi et al. (1989), it is easy to quantify this effect 
(Pcllo et al. 1998). In general, the S/N is expected to im- 
prove by a factor of at least ~ 2 to 3 with respect to the 
pure 2D case, depending on the cluster redshift and rich- 
ness, the set of filters used and the depth of the survey. 
When considering more elaborated cluster-finding algo- 
rithms, such as the one produced by Kepner et al. (1999), 
Olscn et al. (1999), Scodeggio et al. (1999), Kawasaki et 



al. (1998) or Dcltorn et al. (2000, in preparation), these 
results could be regarded as the relative improvement due 
to photometric redshifts. The present version of hyperz is 
also able to display the probability of each object to be at 
a fixed redshift. This is useful when looking for clusters of 
galaxies at a given (guessed) redshift. 

The study of clustering properties through the spatial 
correlation function of galaxies, using the angular corre- 
lation together with the z p hot information is another pos- 
sible application of z p hot, aiming to extend the study of 
galaxy properties to fainter limits in magnitude. In this 
case, the relatively high number of objects accessible to 
photometry per redshift bin, suitably defined according 
to photometric redshift accuracy, allows to enlarge the 
spectroscopic sample towards the faintest magnitudes, and 
also to strongly reduce the errors (because the number 
of objects per redshift bin strongly increases). Studies on 
the evolution of the angular correlation function of galax- 
ies in the HDF-N applying the photometric redshift tech- 
nique can be found in Miralles & Pello (1998), Connolly ct 
al. (1998), Roukcma et al. (1999), Arnouts et al. (1999), 
Magliocchetti & Maddox (1999), . 

The same slicing procedure can be adopted to study 
the evolution of the luminosity function and consequently 
to infer the star formation history at high redshift from 
the UV luminosity density, as well as to analyse the stel- 
lar population and the evolutionary properties of distant 
galaxies (e.g. Yee et al. 1996, SubbaRao et al. 1996, Gwyn 
& Hartwick 1996, Sawicki et al. 1997, Connolly ct al. 1997, 
Pascarelle et al. 1998, Giallongo et al. 1998). 

Furthermore, the photometric redshift method has 
been used to investigate the nature of Extremely Red Ob- 
jects (EROs) with a "spectro-photometric" technique by 
Cimatti et al. (2000), deducing clues about the model of 
galaxy formation. Another kind of spectroscopic and pho- 
tometric combination has led to the identification of very 
high redshift object, as described by Chen et al. (1999). 

From this not exhaustive list of applications, it is evi- 
dent that photometric redshifts are a powerful and promis- 
ing tool in many areas of extragalactic research. This 
method shall not be regarded only as a "poor person's 
redshift machine" , but as a fundamental instrument, since 
a multitude of faint objects will remain beyond the limits 
of spectroscopy for the next years. Even with the diffusion 
of Multi-Object Spectrometers, most of the faint galaxies 
with measured photometry will fall beyond the reach of 
conventional spectroscopy. 

7. Conclusions 

We have presented the characteristics and the perfor- 
mances of our public code hyperz, available on the web, 
which make use of the template SED fitting technique. 
We can summarize the main conclusions as follows: 

1. Simulations of ideal catalogues have shown the main 
trends of the accuracy on z p hot calculations. In par- 
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ticular, z p hot estimates are improved when the filters 
set spans a wide wavelength range, including near-IR 
and U filters, and when the photometric errors become 
small. 

2. We have investigated the weight of the different param- 
eters on the final results, using both a spectroscopic 
subsamplc of HDF and simulations. In particular, the 
templates, the flux decrement by Lyman forest, the age 
of the stellar population, the reddening, the cosmology, 
the metallicity, the IMF and the presence of emission 
lines have been discussed. According to these results, 
the Zphot preciseness seems to be more sensitive to the 
photometric accuracy rather than to the detailed set of 
parameters. Nevertheless, a subset of these parameters 
(reddening, age of the stellar population and Lyman 
forest blanketing) has to span a sufficiently wide range 
of values to obtain accurate ZphotS- 

3. The robustness of the method has been illustrated 
through realistic deep field simulations, aiming to re- 
produce the redshift distribution, photometric accu- 
racy and limiting magnitudes encountered in deep field 
surveys. 

4. We have pointed out some of the manifold applications 
of the photometric redshift machinery in present and 
future projects. 

5. We plan to include AGN SEDs in the present scheme 
of hyperz, as well as stellar templates, in order to au- 
tomatically classify objects in a photometric survey 
through a unique pipeline. This particular application 
is presently under development (Hatziminaoglou et al. 
2000). 
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